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WHAT IS CLAIMED IS; 



l\ A method comprising: 

segmenting video data to create a video clip based 
on timing \lata; and 

determining a most likely gesture in the video clip, 

2. The mWthod of claim 1, wherein determining includes 
determining a probability that each of a plurality of 
predefined gestures\are performed in the video clip contains 
the predefined gestures 

3. The method of c^aim 2, wherein determining the 
probability that the video Vlip contains each of the 
predefined gesture includes e\aluations of Hidden Markov 
Models . 



4. The method of claim 1, wKerein the timing data 
includes beat data corresponding to beat of audio data, 

5. The method of claim 4, furthe\ comprising : 
receiving the audio data; and 

extracting the beat data from the\audio data, 

6. The method of claim 4, wherein the ^deo clip 
includes a portion of the video data corresponding to a 
predefined time window surrounding the occurrence \pf at least 
one beat. 
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1 A- The method of claim 1, further comprising displaying 

2 a targe^ gesture to be performed by the subject of the video 

3 data. 

1 8. Tl^e method of claim 1, wherein each video clip 

2 contains videa frames 
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9. The method of claim 1, further comprising 
identifying moving Iregions in each video frame in the video 
clip . 

10. The method of Vlaim 9, further comprising generating 
a feature vector for each Video frame of the video clip. 



11. The method of claiirv 1, further comprising generating 
a score based on whether the v\deo clip contains the target 
gesture . 



12. The method of claim 11, further comprising 
displaying the score. 

13. The method of claim 1, wherei^ determining if the 
video clip contains the predefined gestur\ includes generating 
a gesture probability vector having a plurality of elements, 
each element being associated with one of a- plurality of 
predefined gestures and representing a probability that the 
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6 video clip contains each of the associated predefined 

7 gestures. \ 
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14. A\system comprising: 

a temporal segmentor connected to receive video data and 
to create a vid^o clip from the video data based on timing 
data; and 

a recognition^ngine, in communication with the temporal 
segmentor, to determine if the video clip contains a 
predefined gesture . 



N^im 14, wherein the recogniti 



15. The system of 
engine includes a plurality^ of Hidden Markov Models, 



on 



1 16. The system of claim\l4, further comprising: 

2 a timing data source, in communication with the temporal 

\ 

3 segmentor, to provide the timing data to the temporal 

4 segmentor; and 

5 a video source, in communication with the temporal 

6 segmentor, to provide the video data to v the temporal 

7 segmentor. 
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17. The system of claim 14, further comprising a move 

\ 

subsystem, in communication with the timing d^ata source, to 
provide a target gesture to be performed by thew subject of the 



video data, 
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1 18\ The system of claim 17, wherein the target gesture 

2 is a dancie move that is to be performed by the subject of the 

3 video dataX 

1 19. TheXsystem of claim 17, further comprising a scoring 

2 subsystem, in communication with the recognition engine and 

3 the move subsystera, to determine if the video clip contains 

4 the target gesture A 

1 20. The system asE claim 19, further comprising a display 

2 subsystem, in communication with the scoring subsystem, to 

3 display a score that is a function of whether the video clip 

4 contains the target gestureX 

1 21. The system of claim ^0, wherein the display 

2 subsystem is in communication wi\h the move subsystem and is 

3 configured to display a gesture request based on the target 

4 gesture. X 

1 22. The system of claim 14, wherein the recognition 

2 engine is configured to recognize predefined gestures and to 

3 produce a gesture probability vector having, elements, each 

4 element being associated with one of the predefined gestures 

5 and representing the probability that the vide\ clip contains 

6 the associated predefined gesture. X 
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1 2^. The system of claim 14, wherein the timing data 

2 source iVcludes: 

3 an audio source that provides an audio data; and 

4 a beat \extractor , in communication with the audio source, 

5 that extracts\beat data from the audio data. 

1 24. The system of claim 23, wherein the video clip 

2 corresponds to a beat in the beat data. 

1 25. The system of claim 24, wherein the video clip 

2 includes a portion of tnk video data corresponding to a 

3 predefined time window surrounding the occurrence of the beat. 

1 26. A computer program Woduct, tangibly stored on a 

2 computer-readable medium, for recognizing gestures contained 

3 in video data, comprising instructions operable to cause a 

4 programmable processor to: 

5 segment the video data to create a video clip based 

6 on timing data; and 

7 determine if the video clip contains a predefined 

8 gesture. 

1 27. The product of claim 26, further coftrprising 

2 instructions operable to cause the programmableX processor to: 

3 extract beat data from an audio signal; and 
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4 segment the video data to create the video clip using the 

5 beat data.N 
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28. An \audio-visual processing system including: 
a video source to provide video data; 
an audio soVrce to provide audio data; 

a speaker to play at least a portion of the audio data; 



and 



a computer program product, tangibly stored on a 
computer-readable medium*, for recognizing gestures contained 
in video data, comprisingXinstructions operable to cause a 
programmable processor, in communication with the video source 
and the audio source, to: 

extract beat data froni the audio data; 
segment the video data \o create a video clip based 
on beat data; and 

determine if the video clip^contains a predefined 
gesture . 

29. The video processing system of Vlaim 28, wherein the 
computer program product further includes instructions 
operable to cause the programmable processor Vo: 

perform a Hidden Markov Model process to determine if the 
video clip contains the predefined gesture. 
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30. The video processing system of claim 28, further 
comprising a displSay to display information based on whether 
the video clip contains the predefined gesture. 
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